Online Similarity Learning for Big Data with Overfitting
نویسندگان
چکیده
منابع مشابه
Online Machine Learning in Big Data Streams
The area of online machine learning in big data streams covers algorithms that are (1) distributed and (2) work from data streams with only a limited possibility to store past data. The first requirement mostly concerns software architectures and efficient algorithms. The second one also imposes nontrivial theoretical restrictions on the modeling methods: In the data stream model, older data is...
متن کاملSemi-supervised Online Multiple Kernel Learning Algorithm for Big Data
In order to improve the performance of machine learning in big data, online multiple kernel learning algorithms are proposed in this paper. First, a supervised online multiple kernel learning algorithm for big data (SOMK_bd) is proposed to reduce the computational workload during kernel modification. In SOMK_bd, the traditional kernel learning algorithm is improved and kernel integration is onl...
متن کاملOnline Multitask Relative Similarity Learning
Relative similarity learning (RSL) aims to learn similarity functions from data with relative constraints. Most previous algorithms developed for RSL are batch-based learning approaches which suffer from poor scalability when dealing with realworld data arriving sequentially. These methods are often designed to learn a single similarity function for a specific task. Therefore, they may be sub-o...
متن کاملOnline tools for demonstration of backtest overfitting
In mathematical finance, backtest overfitting means the usage of historical market data (a backtest) to develop an investment strategy, where too many variations of the strategy are tried, relative to the amount of data available. Backtest overfitting is now thought to be a primary reason why quantitative investment models and strategies that look good on paper often disappoint in practice. In ...
متن کاملLarge Scale Online Learning of Image Similarity Large Scale Online Learning of Image Similarity Through Ranking
Learning a measure of similarity between pairs of objects is an important generic problem in machine learning. It is particularly useful in large scale applications like searching for an image that is similar to a given image or finding videos that are relevant to a given video. In these tasks, users look for objects that are not only visually similar but also semantically related to a given ob...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Big Data
سال: 2018
ISSN: 2332-7790,2372-2096
DOI: 10.1109/tbdata.2017.2688360